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PATENT 



Amendments to the Specification: 

Please replace paragraph [0026] beginning at page 8, line 22, with the following: 

-[0026] A consensus sequence may be represented by a formula, comprising both fixed 
and variable bases. Thus, the consensus sequence for the signature motif for the family of 
nuclear hormone receptors mentioned above and shown in Example 1 is: 

[ (A/T/G) (C/T) (A/G/T/C) ] [TG (T/C) ] [ (A/G) (A/G) (A/C/G/T) ] 
[ (A/G) (C/G) (A/C/G/T) ] [TG (T/C) ] [ (A/T) (A/C/G) (A/C/G) ] 
[(A/G) (C/G/T) (A/C/G/T)] (SEOIDNO:28n 

As can be seen, among the variable bases, some may be fully random , i.e., they may be any of 
the four possible bases, A, C, G or T. Others may be partially random, i.e., they may comprise 
only two or only three predetermined bases of the four possible bases. Generally, in determining 
a consensus sequence, all possible codon variations for a given amino acid will be taken into 
account; however, for various reasons, including the need to limit the complexity {i.e. size) of the 
siRNA library, the consensus sequence may be restricted to include only the specific codons 
known to code for the amino acids comprising the known members of the protein family. 

Please replace paragraph [0040] (and Table 1) beginning at page 14, line 8, with the following: 

-[0040] An additional and effective way to reduce the complexity of a library is to divide 
the members of a protein family of interest into two or more sub-sets, each sub-set comprising 
members having a variant of the signature sequence, each such variant comprising a relatively 
high number of amino acids that are constant for all members of the sub-set. The effect of such 
division can be seen clearly with reference to Example 2 and Table 1 below, which shows the 
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division into seven sub-sets of the 89 known members of the family of tyrosine kinases. Each of 
sub-sets 1 and 4-7 have a different variant of the signature motif, but all five comprise seven 
amino acids that are constant for all members of the respective sub-set. Sub-set 3 has a variant 
signature sequence in which only one of the seven amino acids is not constant for all members of 
the sub-set; and only sub-set 2 has a variant signature motif in which three of the amino acids are 
not constant for all members. 



Table 1 



Variant 


Signature Motif 


SEQ DD 

NO: 


No. of 
Known 
Members 


Complexity 


1 


H 


R D 


L 


K 


S 


S 


282 


3 


4 


2 


H 


R N/D 


L/V/I 


A 


A/V 


R 


283 


3 


2,304 


3 


H 


R D 


L 


R 


A/S 


A 


284 


8 


10,368 


4 


H 


R/K D 


L 


A 


T 


R 


285 


9 


2,592 


5 


H 


R D 


L 


A 


A 


R 


286 


61 


8,192 


6 


H 


K D 


L 


A 


A 


R 


287 


3 


576 


7 


H 


R D 


I 


A 


A 


R 


288 


2 


32 


Total 


89 


24,068 
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Please replace paragraph [0050] beginning at page 18, line 2, with the following: 



-[0050] The symbols for amino acids used in the examples are as 


A Alanine 


M 


Methionine 


C Cysteine 


N 


Asparagine 


D Aspartic acid 


P 


Proline 


E Glutamic acid 


Q 


Glutamine 


F Phenylalanine 


R 


Arginine 


G Glycine 


S 


Serine 


H Histidine 


T 


Threonine 


I Isoleucine 


V 


Valine 


K Lysine 


W 


Tryptophan 


L Leucine 


Y 


Tyrosine 



Example 1 

Family of human nuclear hormone receptors (ZnF_C4 domain) - 45 members 

In this example, a single signature motif was designed based on the zinc finger domain present in 
all 45 known members of the nuclear hormone receptor family. A short segment of the zinc 
finger domain present in each of the 45 known family members is shown below. The consensus 
sequence was "reverse translated" utilizing only those codons that encode the signature motif 
region of known members of the family. Using a full 21 -nucleotide consensus sequence to 
construct the siRNA library, the complexity would be 10,616,832. By reducing the length of the 
consensus sequence to 19 nucleotides, the complexity is reduced to 884,736. SiRNAs as short as 
19 nucleotides are highly efficient at reducing their cognate mRNA levels [Czauderna, F. et aL, 
Nucl Acids Res. 31: 2705-2716 (2003)], therefore, reducing the length of the consensus 
sequence will have little, if any, effect on the degree of silencing produced by members of the 
library. 



Page 5 of 32 



Appl. No. 10/776,399 
Amdt. dated June 25, 2004 

Reply to Notice to File Missing Parts of May 18, 2004 

tataatgcactgacctgtgaggggtgtaaaggtttcttcaggaga 

Y N A L T C E G C K G F F R R 

tacggcgtgcgcacctgtgagggctgcaaaggcttctttaagcgc 

Y G V R T C E G C K G F F K R 

tacggcgtgcgcacctgtgagggctgcaaaggcttctttaagcgc 

Y G V R T C E G C K G F F K R 

tacggcgtgcgaacctgcgagggctgcaagggctttttcaagaga 

Y G V R T C E G C K G F F K R 

tatggtgtccgcacatgtgagggctgcaagggcttcttcaagcgc 

Y G V R T C E G C K G F F K R 

tatggagcagtaacttgtgaaggctgcaaaggattttttaaaaga 

Y G A V T C E G C K G F F K R 

tacggggttatcacctgtgaggggtgcaagggcttcttccgccgg 

Y G V I T C E G C K G F F R. R 

tacggagtcatcacatgtgaaggctgcaagggattctttaggagg 

Y G V I T C E G C K G F F R R 

tatggtgtcattacatgtgaaggctgcaagggctttttcaggaga 

Y G V I T C E G C K G F F R R 

tatggagtgtacagctgcgaggggtgcaagggcttcttcaagcgg 

Y G V Y S C E G C K G F F K R 

tacggggtttacagctgtgagggttgcaagggcttcttcaaacgc 

Y G V Y S C E G C K G F F K R 

tacggggtatacagttgtgaaggctgcaaagggttcttcaagagg 

Y G V Y S C E G C K G F F K R 

tacaacgtgctcagctgcgaaggctgcaagggcttcttccggcgc 

Y N V L S C E G C K G F F R R 

tacaatgttctgagctgcgagggctgcaagggattcttccgccgc 

Y N V L S C E G C K G F F R R 

tatgggatcatctcctgtgagggctgcaaagggtttttcaagcgg 
YGIISCEGCKGFFKR 



PATENT 



SEQIDNO:l) 
SEQ ID NO:2) 

SEQ ID NO:3) 
SEQ ID NO:4) 

SEQ ID NO:5) 
SEQ ED NO:6) 

SEQ ED NO:7) 
SEQ ED NO:8) 

SEQ ID NO:9) 
SEQ ID NO: 10) 

SEQ ID NO: 11) 
SEQ ID NO: 12) 

SEQ ID NO: 13) 
SEQ ID NO: 14) 

SEQ ID NO: 15) 
SEQ ID NO: 16) 

SEQ ID NO: 17) 
SEQ ID NO: 18) 

SEQ ID NO: 19) 
SEQ ID NO:20) 

SEQ ID NO:21) 
SEQ ID NO:22) 

SEQ ID NO:23) 
SEQ ID NO:24) 

SEQ ID NO:25) 
SEQ ID NO:26) 

SEQ ID NO:27) 
SEQ ID NO:28) 

SEQ ID NO:29) 
SEQ ID NO:30) 
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tatggggtcagctcttgtgaaggctgcaagggcttctttcgccga 

Y G V S S C E G C K G F F R R 

tatggggtcagctcttgtgaaggctgcaagggcttctttcgccga 

Y G V S S C E G C K G F F R R 

tatggggctgtcagttgtgaaggttgcaaaggtttcttcaaaagg 

Y G A V S C E G C K G F F K R 

tacggtgtcttcacctgcgagggctgcaagagctttttcaagcga 

Y G V F T C E G C K S F F K R 

tacggccagttcacgtgcgagggctgcaagagcttcttcaagcgc 

Y G Q F T C E G C K S F F K R 

tacggggtctacgcctgcgacggctgctcaggttttttcaaacgg 

Y G V Y A C D G C S G F F K R 

tatggcatctatgcctgcaacggctgcagcggcttcttcaagagg 

Y G I Y A C N G C S G F F K R 

tatggggcatccacctgtgatgggtgcaagggtttcttcagacgc 

Y G A S T C D G C K G F F R R 

tacggtgcctcgagctgtgacggctgcaagggcttcttccggagg 

Y G A S S C D G C K G F F R R 

tatggggtcagcgcctgtgagggctgcaagggcttcttccgccgc 

Y G V S A C E G C K G F F R R 

tatggggtcagcgcctgtgagggatgtaagggctttttccgcaga 

Y G V S A C E G C K G F F R R 

tacggtgtgcacgcctgcgagggctgcaagggctttttccgtcgg 

Y G V H A C E G C K G F F R R 

tatggagttcatgcttgcgaaggctgtaagggtttctttcggaga 

Y G V H A C E G C K G F F R R 

tacggtgttcatgcatgtgaggggtgcaagggcttcttccgtcgt 
YGVHACEGCKGFFRR 



PATENT 



SEQIDNO:31) 
SEQ ID NO:32) 

SEQ ID NO:33) 
SEQ ID NO:34) 

SEQ ID NO:35) 
SEQ ID NO:36) 

SEQ ID NO:37) 
SEQ ID NO:38) 

SEQ ID NO:39) 
SEQ ID NO:40) 

SEQIDNO:41) 
SEQ ID NO:42) 

SEQ ID NO:43) 
SEQ ID NO:44) 

SEQ ID NO:45) 
SEQ ID NO:46) 

SEQ ID NO:47) 
SEQ ID NO:48) 

SEQ ID NO:49) 
SEQIDNO:50) 

SEQIDNO:51) 
SEQ ID NO: 52) 

SEQ ID NO:53) 
SEQ ID NO:54) 

SEQ ID NO:55) 
SEQIDNO:56) 

SEQIDNO:57) 
SEQ ID NO:58) 
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tacggagtccacgcgtgtgaaggctgcaagggcttctttcggcga 

Y G V H A C E G C K G F F R R 

tatggagttcatgcttgtgaaggatgcaagggtttcttccggaga 

Y G V H A C E G C K G F F R R 

ttcaatgtcatgacatgtgaaggatgcaagggctttttcaggagg 
F N V M T C E G C K G F F R R 

tttaatgcgctgacttgtgagggctgcaagggtttcttcaggaga 
F N A L T C E G C K G F F R R 

taccgctgtatcacgtgtgaaggctgcaagggtttctttagaaga 

Y R C I T C E G C K G F F R R 

taccgctgtatcacttgtgagggctgcaagggcttctttcgccgc 

Y R C I T C E G C K G F F R R 

tacggactgctcacgtgtgagagctgcaagggcttcttcaagcgc 

Y G L L T C E S C K G F F K R 

tatgggctcctcacctgtgaaagctgcaagggattttttaagcga 

Y G L L T C E S C K G F F K R 

tatggggtagtcacctgtggcagctgcaaagt'tttcttcaaaaga 

Y G V V T C G S C K V F F K R 

tatggagctctcacatgtggaagctgcaaggtcttcttcaaaaga 

Y G A L T C G S C K V F F K R 

tatggtgtccttacctgtgggagctgtaaggtcttctttaagagg 

Y G V L T C G S C K V F F K R 

tatggagtcttaacttgtggaagctgtaaagttttcttcaaaaga 

Y G V L T C G S C K V F F K R 

tacggcgtggcctcctgcgaggcttgcaaggccttcttcaagagg 

Y G V A S C E A C K A F F K R 

tatggtgtggcatcctgtgaggcctgcaaagccttcttcaagagg 
YGVASCEAC KAFFKR 



SEQ ID NO:59) 
SEQ ID NO:60) 

SEQJDNO:61) 
SEQ ID NO:62) 

SEQ ID NO:63) 
SEQ ID NO:64) 

SEQ ID NO:65) 
SEQ ID NO:66) 

SEQ ID NO:67) 
SEQ ID NO:68) 

SEQ ID NO:69) 
SEQ ID NO:70) 

SEQ ID NO:71) 
SEQ ID NO:72) 

SEQ ID NO:73) 
SEQ ID NO:74) 

SEQ ID NO:75) 
SEQ ID NO:76) 

SEQ ID NO:77) 
SEQIDNO:78) 

SEQ ID NO:79) 
SEQIDNO:80) 

SEQIDNO:81) 
SEQ ID NO:82) 

SEQ ID NO:83) 
SEQ ID NO:84) 

SEQ ID NO:85) 
SEQ ID NO:86) 
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tatggagtctggtcctgtgagggctgcaaggccttcttcaagaga (SEQ ID NO:87) 

Y G V W S C E G C K A F F K R (SEQ ID NO:88) 

tatggagtctggtcgtgtgaaggatgtaaggccttttttaaaaga (SEQ ID NO:89) 

Y G V W S C E G C K A F F K R (SEQ ID NO:90) 



Signature Motif: 

(T/S/A)-C-(D/E/G/N)-(G/S/A)-(C)-(K/S)-(A7G/SA^) 



Consensus sequence (21 nt USEO ID NO;281 : 

( A/T/G) (C/T) (A/G/T/C ) TG(T/C ) ( A/G) (A/G) (A/C/G/T ) 
T/S/A C D/E/G/N 

( A/G) (C/G) (A/C/G/T ) TG(T/C ) ( A/T) (A/C/G) (A/C/G ) 
G/S/A C K/S 

( A/G) (C/G/T) (A/C/G/T ) 
G/S/V/A 



Complexity: 2 9 x 3 4 x 4 4 = 512 x 81 x 256 = 10,616,832 members 



Consensus sequence (19 nfl (SEP ID NO:289) : 

( A/T/G) (C/T) (A/G/T/C ) TG(T/C ) ( A/G) (A/G) (A/C/G/T ) 
T/S/A " C D/E/G/N 

( A/G) (C/G) (A/C/G/T ) TGjT/C) ( A/T) (A/C/G) (A/C/G ) 
G/S/A C K/S 

( A/G) -- 
G/S/V/A 



Complexity: 2 9 x 3 3 x 4 3 = 512 x 27 x 64 - 884,736 members 
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Example 2 



Family of tyrosine kinases - 89 members 

This example shows the identification of seven variants of a portion of the catalytic 
domain of the family of tyrosine kinases. As shown in Table 1 above, these may then be used 
for the production of library of siRNAs targeting this domain having a reduced complexity of 
24,068 unique members. 

Variant 1: 3 members 

gttcccatcatccaccgcgaccttaagtccagcaacatattgatcctc 

V P I I H R D L K S S N I L I L 

gtgcccatcctgcaccgggacctcaagtccagcaacattttgctactt 

V P I L H R D L K S S N I L L L 

gtgcccatcctgcaccgggacctcaagtccagcaacattttgctactt 
VPILHRDLKSSNILLL 



(SEQIDNO:91) 
(SEQ ED NO:92) 

(SEQ ID NO:93) 
(SEQ ID NO:94) 

(SEQ ID NO:95) 
(SEQ ID NO:96) 



Signature Moti f (SEP ID NO:282) : H R D L K S S 

Consensus Sequence (SEP ID NO:290) : 

CAC CG(C/G) GAC CT(C/T) AAG TCC AGC 
H R D L K S S 

Complexity: 2 2 = 4 members 



Variant 2: 3 members 

catggtatggtgcatagaaacctggctgcccgaaacgtgctactcaag (SEQ ID NP:97) 

H G M V H R N L A A R N V L L K (SEQ ED NP:98) 

aagaattgcatccaccgggacgtggcagcgcgtaacgtgctgttgacc (SEQ ID NP:99) 

K N C I H R D V A A R N V L L T (SEQ ID NP: 1 00) 

atcaactgcgtgcacagggacattgctgtccggaacatcctggtggcc (SEQ ED NP:101) 

I N C V H R D I A V R N I L V A (SEQ ED NP:102) 
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Signature Moti f (SEP ID NO:283) : H R D/N I/V/L A A/V R 
Consensus Sequenc e fSEP ID NO:291) : 

CA(T/C) (C/A)G(G/A) (G/A)AC (A/C/G) T (T/G) GC (T/A) G(T/C) (C/G) 
H R D/N) (I/V/L) A (A/V) 

CG(A/T/G) 
R 

Complexity: 2 8 x 3 2 = 256 x 9 = 2,304 members 



Variant 3: 8 members 



atgaactacgtccaccgggaccttcgtgcagccaacatcctggtggga 
M N Y V H R D L R A A N I L V G 

atgaactatattcaccgagatcttcgggctgctaatattcttgtagga 
M N Y I H R D L R A A N I L V G 

atgaattatatccatagagatctgcgatcagcaaacattctagtgggg 
M N Y I H R D L R S A N I L V G 

atgaactacattcaccgcgacctgagggcagccaacatcctggttggg 
M N Y I H R D L R A A N I L V G 

aagaattccatccaccgcgacctgcgggcggccaacatcctggtgtct 
M N S I H R D L R A A N I L V S 

aggaactacatccaccgagacctccgagctgccaacatcttggtctt 
R N Y I H R D L R A • A N I L V S 

aagaactacattcaccgggacctgcgagcagctaatgttctggtctcc 
K N Y I H R D L R A A N V L V S 

cggaattatattcatcgtgaccttcgggctgccaacattctggtgtct 
RNYI HRDLRAANI LVS 



SEQIDNP:103) 
SEQJDNO:104) 

SEQIDNO:105) 
SEQIDNO:106) 

SEQJDNO:107) 
SEQIDNO:108) 

SEQIDNO:109) 
SEQIDNO:110) 

SEQIDNO:lll) 
SEQIDNO:112) 

SEQIDNO:113) 
SEQIDNO:114) 

SEQIDNO:115) 
SEQIDNO:116) 

SEQJDNO:117) 
SEQIDNO:118) 



Signature Moti f (SEP ID NO:2841 : HRDLRA/SA 
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Consensus Sequence (SEP ID NO:292V 

CA(C/T) (C/A)G(A/C/G/T) GA(C/T) CT(C/G/T) (A/C) G (A/G/T) 
H R D " L R 

(G/T)C (A/G/T) GC(A/C/T) 
(A/S) ~~ ~ A 

Complexity: 2 5 x 3 4 x 4= 32 x 81 x 4 = 10,368 members 



Variant 4: 9 Members 

ctgcattttgtgcaccgggacctggccacacgcaactgtctagtggg 
L H F V H R D L A T R N C L V G 

ctcaactttgtacatcgggacctggccacgcggaactgcctagttggg 
L N F V H R D L A T R N C L V G 

cttaattttgttcaccgagatctggccacacgaaactgtttagtgggt 
L N F V H R D L A T R N C L V G 

cgcgggctggtgcaccgagacctcgctacgcgcaacctactgctggcg 
R G L V H R D L A T R N L L L A 

aaaaggtatatccacagggatctggcaacgagaaatatattggtggag 
K R Y I H R D L A T R N I L V E 

cagcactttgtgcaccgagacctggccaccaggaactgcctggttgga 
Q H F V H R D L A T R N C L V G 

cagcacttcgtgcaccgcgatttggccaccaggaactgcctggtcggg 
Q H F V H R D L A T R N C L V G 

caccacgtggttcacaaggacctggccacccgcaatgtgctagtgtac 
H H V V H K D L A T R N V L V Y 

cgtaagtttgttcaccgagatttagccaccaggaactgcctggtgggc 
RKFVHRDLATRNCLVG 



SEQJDNO:119) 
SEQIDNO:120) 

SEQIDNO:121) 
SEQIDNO:122) 

SEQDDNO:123) 
SEQIDNO:124) 

SEQIDNO:125) 
SEQIDNO:126) 

SEQJDNO:127) 
SEQE)NO:128) 

SEQIDNO:129) 
SEQIDNO:130) 

SEQIDNO:131) 
SEQIDNO:132) 

;SEQIDNO:133) 
;SEQIDNO:134) 

;SEQIDNO:135) 
;SEQIDNO:136) 
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Signature Moti f (SEP ID NO:2851 : H R/K D L A T 
Consensus Sequence (SEP ID NO:293) : 



PATENT 



R 



CAC (A/C) (A/G) (A/C/G) GA(C/T) 
H R/K ' D 

AC (A/C/G) ( A/C) G (A/C/G) 
T R 



(C/T)T (A/C/G) GC (A/C/T) 



Complexity: 2 3 x 3 5 = 32 x 81 = 2,592 members 



Variants: 61 members 

aagaagcttgtgcaccgcgacctggccgcccgcaacatcctggtctca 
K K L V H R D L A A R N I L V S 

aagaagcttgtgcaccgggacctagccgcccgcaacatcctggtctca 
K K L V H R D L A A R N I L V S 

aacaatttcgtgcatcgagacctggctgcccgcaatgtgctggtgtct 
N N F V H R D L A A R N V L V S 

cacgactacatccaccgagacctagccgcgcgcaacgtgctgctggac 
H D Y I H R D L A A R N V L L D 

cggcaatacgttcaccgggacttggcagcaagaaatgtccttgttgag 
R Q Y V H R D L A A R N V L V E 

cgtcgcttggtgcaccgcgacctggcagccaggaacgtactggtgaaa 
R R L V H R D L A A R N V L V K 

cggaacttcatccaccgagacctggctgctcggaattgcatgctggca 
R N F I H R D L A A R N C M L A 

aagaagtgcatacaccgagacctggcagccaggaatgtcctggtgaca 
K K C I H R D L A A R N V L V T 

caaaaatgtattcatcgagatttagcagccagaaatgttttggtaaca 
Q K C I H R D L A A R N V L V T 

cagaagtgcatccacagggacctggctgcccgcaatgtgctggtgacc 
Q K C I H R D L A A R N V L V T 



SEQIDNP:137 
SEQIDNP:138 

SEQIDNP:139 
SEQK>NP:140 

SEQK>NP:141 
SEQIDNP:142 

SEQIDNP:143 
SEQIDNP:144 

SEQIDNP:145 
SEQIDNP:146 

SEQIDNP:147 
SEQH)NP:148 

SEQIDNP:149 
SEQIDNP:150; 

SEQIDNP:151 
SEQIDNP:152 

SEQIDNP:153 
SEQIDNP:154 

SEQIDNP:155 
SEQIDNP:156 
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cagaagtgtattcacagagacttggctgccagaaacgtcctggtgacc 
Q K C I H R D L A A R N V L V T 

cggaagtgtatccaccgggacctggctgcccgcaatgtgctggtgact 
R K C I H R D L A A R N V L V T 

atgaagctcgttcatcgggacttggcagccagaaacatcctggtagct 
M K L V H R D L A A R N I L V A 

agaaagtgcattcatcgggacctggcagcgagaaacattcttttatct 
R K C I H R D L A A R N I L L S 

cgaaagtgcatccacagagacctggctgctcggaacattctgctgtcg 
R K C I H R D L A A R N I L L S 

cgaaagtgtatccacagggacctggcggcacgaaatatcctcttatcg 
R K C I H R D L A A R N I L L S 

aagaactgcgtccacagagacctggcggctaggaacgtgctcatctgt 
K N C V H R D L A A R N V L I C 

aaaaattgtgtccaccgtgatctggctgctcgcaacgtcctcctggca 
K N C V H R D L A A R N V L L A 

aagaattgtattcacagagacttggcagccagaaatatcctccttact 
K N C I H R D L A A R N I L L T 

aagtcgtgtgttcacagagacctggccgccaggaacgtgcttgtcacc 
K S C V H R D L A A R N V L V T 

aaacagtttattcacagggacctagctgccaggaacattttagttggt 
K Q F I H R D L A A R N I L V G 

aagcagttcatccacagggacctggctgcccggaatgtgctggtcgga 
K Q F I H R D L A A R N V L V G 

aagcagttcatccacagggacctggctgcccggaatgtgctggtcgga 
K Q F I H R D L A A R N V L V G 

atgaactatgtgcaccgtgacctggctgcccgcaacatcctcgtcaac 
MNYVHRDLAARNI LVN 



SEQIDNO:157 
SEQIDN0:158 

SEQIDNO:159 
SEQIDNO:160; 

SEQIDNO:161 
SEQIDNO:162 

SEQIDNO:163 
SEQIDNO:164; 

SEQIDNO:165 
SEQIDNO:166 

SEQIDNO:167 
SEQIDNO:168 

SEQIDNO:169 
SEQIDNO:170 

SEQIDNO:171 
SEQIDNO:172 

SEQIDNO:173 
SEQIDNO:174 

SEQIDNO:175 
SEQIDNO:176; 

SEQIDNO:177 
SEQIDNO:178 

SEQIDNO:179 
SEQIDNO:180; 

SEQIDNO:181 
SEQE)NO:182 

SEQIDNO:183 
SEQIDNO:184; 
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atgcatt teat tcacagggatctggcagctagaaattgccttgtt tec 
M H F I H R D L A A R N C L V S 

aacaagtttgtgcaccgagatctagcagcccgcaactgcatggtgtcc 
N K F V H R D L A A R N C M V S 

aataagttcgtccacagagaccttgctgcccggaattgcatggtagcc 
N K F V H R D L A A R N C M V A 

aagaagtttgtgcatcgggacctggcagcgagaaactgcatggtcgcc 
K K F V H R D L A A R N C M V A 

aagagattcatacaccgggacctggcggccaggaactgcatgctgaat 
K R F I H R D L A A R N C M L N 

atgaactatgttcaccgtgacctggctgcccgcaacatcctcgtcaac 
M N Y V H R D L A A R N I L V N 

atgaactatgtgcaccgcgacctggctgctcgcaacatccttgtcaac 
M N Y V H R D L A A R N I L V N 

atgaattatgtgcatcgggacctggctgctaggaacattctggtcaac 
M N Y V H R D L A A R N I L V N 

atgggctatgtgcatagagatcttgctgccagaaacatcttaatcaac 
M G Y V H R D L A A R N I L I N 

cagaagtttgtgcacagggacctggctgcgcggaactgcatgctggac 
Q K F V H R D L A A R N C M L D 

aaaaagtttgtccacagagacttggctgcaagaaactgtatgctggat 
K K F V H R D L A A R N C M L D 

atgggctatgttcaccgagacctcgctgctcggaacatcttgatcaac 
M G Y V H R D L A A R N I L I N 

aggaattttcttcatcgagatttagctgctcgaaactgcatgttgcga 
R N F L H R D L A A R N C M L R 

aaaaactgtatacacagggaccttgctgcaagaaactgcctggtaggt 
KNC IHRDLAARNCLVG 



SEQIDNO:185 
SEQ ID NO: 186 

SEQ ID NO: 187 
SEQ ID NO: 188 

SEQ ID NO: 189 
SEQ ID NO: 190 

SEQIDNO:191 
SEQ ID NO: 192 

SEQ ID NO: 193 
SEQ ID NO: 194 

SEQIDNO:195 
SEQ ID NO: 196 

SEQ ED NO: 197 
SEQIDNO:198 

SEQ ID NO: 199 
SEQ ID NO:200 

SEQ ID NO:201 
SEQ ID NO:202 

SEQ ID NO:203 
SEQ ID NO:204 

SEQ ID NO:205 
SEQ ID NO:206 

SEQ ID NO:207 
SEQ ID NO:208 

SEQ ID NO:209 
SEQIDNO:210; 

SEQIDNO:211 
SEQ ID NO:212 
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aagtgctgcatccaccgggacctggctgctcggaactgcctggtgaca 
K C C I H R D L A A R N C L V T 

atgagctatgtgcatcgtgatctggccgcacggaacatcctggtgaac 
M S Y V H R D L A A R N I L V N 

atgagctacgtccaccgagacctggctgctcgcaacatcctagtcaac 
M S Y V H R D L A A R N I L V N 

atgggctatgttcaccgagacctcgctgctcggaacatcttgatcaac 
M G Y V H R D L A A R N I L I N 

atgggatatgttcacagggaccttgcagctcgcaatattcttgtcaac 
M G Y V H R D L A A R N I L V N 

cgtcgcttggtgcaccgcgacctggcagccaggaacgtactggtgaaa 
R R L V H R D L A A R N V L V K 

gtgcggctcgtacacagggacttggccgctcggaacgtgctggtcaag 
V R L V H R D L A A R N V L V K 

agacgactcgttcatcgggatttggcagcccgtaatgtcttagtgaaa 
R R L V H R D L A A R N V L V K 

aaaaacttcatccacagagatcttgctgcccgaaactgcctggtaggg 
K N F I H R D L A A R N C L V G 

aagcgctttattcaccgtgacctggctgcccgcaatctgctgttggct 
K R F I H R D L A A R N L L L A 

aagaactttgtgcaccgtgacctggcggcccgcaacgtcctgctggtt 
K N F V H R D L A A R N V L L V 

aagaactttgtgcaccgtgacctggcggcccgcaacgtcctgctggtt 
K N F V H R D L A A R N V L L V 

agcaattttgtgcacagagatctggctgcaagaaatgtgttgctagtt 
S N F V H R D L A A R N V L L V 

cagaattacatccaccgggacctggccgccaggaacatcctcgtcggg 
Q N Y I H R D L A A R N I L V G 



PATENT 



SEQEDNO:213) 
SEQEDNO:214) 

SEQEDNO:215) 
SEQ ID NO:216) 

SEQEDNO:217) 
SEQEDNO:218) 

SEQEDNO:219) 
SEQ ID NO:220) 

SEQEDNO:221) 
SEQ ID NO:222) 

SEQ ID NO:223) 
SEQ ID NO:224) 

SEQ ID NO:225) 
SEQ ID NO:226) 

SEQEDNO:227) 
SEQ ID NO:228) 

SEQEDNO:229) 
SEQ ID NO:230) 

SEQEDNO:231) 
SEQ ID NO:232) 

SEQ ED NO:233) 
SEQ ED NO:234) 

SEQ ED NO:235) 
SEQ ED NO:236) 

SEQ ED NO:237) 
SEQ ED NO:238) 

SEQ ED NO:239) 
SEQ ED NO:240) 
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cagcgcgttgtgcaccgggacttggccgcccggaacgtgctcgtggac 
Q R V V H R D L A A R N V L V D 

cggaactacattcacagagatctggctgccagaaatgtcctcgttggt 
R N Y I H R D L A A R N V L V G 

aagaatttcatccatagagatcttgcagctcgtaactgcctagtggga 
K N F I H R D L A A R N C L V G 

aacagcttcatccacagagatctggctgccagaaattgtctagtaagt 
N S F I H R D L A A R N C L V S 

aatggctatattcatagggatttggcggcaaggaattgtttggtcagt 
N G Y I H R D L A A R N C L V S 

gcatgtgtcatccacagagacttggctgccagaaattgtttggtggga 
A C V I H R D L A A R N C L V G 

caccaattcatacaccgggacttggctgctcgtaactgcttggtggac 
H Q F I H R D L A A R N C L V • D 

aagcagttccttcaccgagacctggcagctcgaaactgtttggtaaac 
K Q F L H R D L A A R N C L V N 

cacaattatgtccaccgggacctggctgccagaaacatcttggtgaat 
HNYVHRDLAARNI LVN 



SEQIDNP:241) 
SEQE)NO:242) 

SEQIDNO:243) 
SEQ ID NO:244) 

SEQIDNO:245) 
SEQE)NO:246) 

SEQIDNO:247) 
SEQIDNO:248) 

SEQIDNO:249) 
SEQIDNO:250) 

SEQIDNO:251) 
SEQIDNO:252) 

SEQEDNO:253) 
SEQIDNO:254) 

SEQ ID NO:255) 
SEQ ID NO:256) 

SEQ ID NO:257) 
SEQ ID NO:258) 



Signature Moti f (SEP ID NO:286) : 



H R D L A A R 



Consensus Sequence (SEP ID NO:294) : 

CA(C/T) (A/C)G(A/C/G/T) GA(C/T) ( T/C) T (A/C/G/T) GC (A/C/G/T) 
H R D L A 



GC (A/C/G/T) ( A/QG (A/C/G/T) 
A R 



Complexity: 2 4 x 4 5 = 16 x 512 = 8,192 members 
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Variant 6: 3 members 

agggaagtcatccacaaagacctggctgccaggaactgtgtcattgat (SEQ ID NO:259) 

R E V I H K D L A A R N C V I D (SEQ ID NO: 260) 

aaccgctttgtgcataaggacttggctgcgcgtaactgcctggtcagt (SEQ ID NO:261) 

N R F V H K D L A A R N C L V S (SEQ ID NO: 262) 

cacttctttgtccacaaggaccttgcagctcgcaatattttaatcgga (SEQ ID NO:263) 

H F F V H K D L A A R N I L I G (SEQ ID NO: 264) 

Signature Moti f (SEP ID NO:287) : H K D L A A R 
Consensus Sequence (SEP ID NO:295) : 

CA(C/T) AA (A/G) GAC ( C/T)T(G/T) GC (A/T) GC(C/G/T) A/C)G(C/G/T) 
H " K D L A A R 

Complexity: 2 6 x 3 2 = 64 x 9 = 576 members 
Variant 7: 2 members 

aatcacttcatccacagggatattgccgcccggaactgcctgctgagc (SEQ ID NO:265) 
N H F I H R D I A A R N C L L S (SEQ ID NO: 266) 

aaccacttcatccaccgagacattgctgccagaaactgcctcttgacc (SEQ ID NO:267) 
N H F I H R D I A A R N C L L T (SEQ ID NO:268) 

Signature Moti f (SEP ID NO:288) : H R D I A A R 
Consensus Sequence (SEP ID NP:296) : 

CAC (A/C)G(A/G) GA(C/T) ATT GC(C/T) GCC (A/C)G(A/G) 

H R D I A A R 

Complexity: 2 5 ^=-33 2 6 = 64 members 
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Example 3 

Family of human nuclear hormone receptors (ZnF_C4 domain) - 45 members divided into 
9 groups 

In this example, the 45 known members of the nuclear hormone receptor family are 
divided into 9 subgroups. The same segment of the Zinc Finger_C4 domain described in 
Example 1 was used to design individual signature motifs and consensus sequences for each of 
the 9 subgroups. As in Example 1, the consensus sequence was "reverse translated" utilizing 
only those codons that encode the signature motif region of known members of the subgroup. 
Division of the family into subgroups dramatically reduces the complexity from 10,616,832 (see 
Example 1) to 1,664. 



Variant 1: 9 members 

tataatgcactgacctgtgaggggtgtaaaggtttcttcaggaga 

Y N A L T C E G C K G F F R R 

tacggcgtgcgcacctgtgagggctgcaaaggcttctttaagcgc 

Y G V R T C E G C K G F F K R 

tacggcgtgcgcacctgtgagggctgcaaaggcttctttaagcgc 

Y G V R T C E G C K G F F K R 

tacggcgtgcgaacctgcgagggctgcaagggctttttcaagaga 

Y G V R T C E G C K G F F K R 

tatggtgtccgcacatgtgagggctgcaagggcttcttcaagcgc 

Y G V R T C E G C K G F F K R 

tatggagcagtaacttgtgaaggctgcaaaggattttttaaaaga 

Y G A V T C E G C K G F F K R 

tacggggttatcacctgtgaggggtgcaagggcttcttccgccgg 

Y G V I T C E G C K G F F R R 

tacggagtcatcacatgtgaaggctgcaagggattctttaggagg 
YGVITCEGCKGFFRR 



SEQ ID NO: 1) 
SEQIDNO:2) 

SEQIDNO:3) 
SEQE)NO:4) 

SEQ ID NO:5) 
SEQ ID NO:6) 

SEQIDNO:7) 
SEQIDNO:8) 

SEQ ID NO:9) 
SEQ ID NO: 10) 

SEQ ED NO: 11) 
SEQ ID NO: 12) 

SEQ ID NO: 13) 
SEQ ID NO: 14) 

SEQ ID NO: 15) 
SEQ ID NO: 16) 
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tatggtgtcattacatgtgaaggctgcaagggctttttcaggaga (SEQ ID NO:17) 

Y G V I T C E G C K G F F R R (SEQIDNP:18) 

Signature Moti f (SEP ID NO:297) : T C E G C K G 
Consensus Sequence (SEP ID NO:298) : 

A-C- (A/C/T) T-G- (C/T) G-A- (A/G) G-G- (C/G) T-G- (C/T) 

T ~ C E G C 

A-A- (A/G) G-G- (A/C/T) 
K G 

Complexity: 2 5 x 3 2 = 32 x 9 = 288 
Variant 2: 9 members 

tatggagtgtacagctgcgaggggtgcaagggcttcttcaagcgg (SEQ ID NP:19) 

Y G V Y S C E G C K G F F K R (SEQ ID NP:20) 



tacggggtttacagctgtgagggttgcaagggcttcttcaaacgc (SEQ ID NP:21) 

Y G V Y S C E G C K G F F K R (SEQ ID NP:22) 

tacggggtatacagttgtgaaggctgcaaagggttcttcaagagg (SEQ ID NP:23) 

Y G V Y S C E G C K G F F K R (SEQ ID NP:24) 

tacaacgtgctcagctgcgaaggctgcaagggcttcttccggcgc (SEQ ID NP:25) 

Y N V L S C E G C K G F F R R (SEQIDNP:26) 

tacaatgttctgagctgcgagggctgcaagggattcttccgccgc (SEQ ID NP:27) 

Y N V L S C E G C K G F F R R (SEQ ID NP:28) 

tatgggatcatctcctgtgagggctgcaaagggtttttcaagcgg (SEQ ID NP:29) 

Y G I I S C E G C K G F F K R (SEQ ID NP:30) 

tatggggtcagctcttgtgaaggctgcaagggcttctttcgccga (SEQ ID NP:31) 

Y G V S S C E G C K G F F R R (SEQ ID NP: 3 2) 
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tatggggtcagctcttgtgaaggctgcaagggcttctttcgccga (SEQ ID NO:33) 

Y G V S S C E G C K G F F R R (SEQJDNP:34) 

tatggggctgtcagttgtgaaggttgcaaaggtttcttcaaaagg (SEQ ID NO:35) 

Y G A V S C E G C K G F F K R (SEQ ID NO: 36) 

Signature Moti f ("SEP ID NP:299) : S C E G C K G 
Consensus Sequenc e (SEP ID NP:300) : 

(A/T) - (C/G) - (C/T) T-G- (C/T) G-A- (A/G) G-G- (C/G/T) T-G-C 

S C E G C 

A-A- (A/G) G-G- (A/C/G/T) 
K ~~ G 

Complexity: 2 6 x 3 x 4 = 64 x 3 x 4 = 768 
Variant 3: 2 members 

tacggtgtcttcacctgcgagggctgcaagagctttttcaagcga (SEQ ID NP:37) 

Y G V F T C E G C K S F F K R (SEQ ID NP:38) 

tacggccagttcacgtgcgagggctgcaagagcttcttcaagcgc (SEQ IDNP:39) 

Y G Q F T C E G C K S F F K R (SEQ ID NP.40) 

Signature Moti f (SEP ID NP:301) : T C E G C K S 
Consensus Sequence (SEP ID NP:302) : 

A-C- (C/G) T-G-C G-A-G G-G-C T-G-C A-A- (A/G) A-G- (C/T) 

T C E G C K S 

Complexity: 2 3 = 8 
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tacggggtctacgcctgcgacggctgctcaggttttttcaaacgg (SEQ ID NO:41) 

Y G V Y A C D G C S G F F K R (SEQ ID NO: 42) 

tatggcatctatgcctgcaacggctgcagcggcttcttcaagagg (SEQ ID NO:43) 

Y G I Y A C N G C S G F F K R (SEQ ID NO:44) 



Signature Moti f (SEP ID NP:303) : A C D/N G C S G 



Consensus Sequence fSEP ID NO:304) : 

G-C-C T-G-C (A/G) -A-C G-G-C T-G-C (A/T) - (C/G) - (A/C) 
A C D/N G C S 

G-G- (C/T) 
G 

Complexity: 2 5 = 32 



Variant 5: 2 members 

tatggggcatccacctgtgatgggtgcaagggtttcttcagacgc (SEQ ID NO:45) 

Y G A S T C D G C K G F F R R (SEQ ID NO:46) 

tacggtgcctcgagctgtgacggctgcaagggcttcttccggagg (SEQ ID NO:47) 

Y G A S S C D G C K G F F R R (SEQ ID NO:48) 



Signature Moti f f SEP ID NO:305V T/SCDGCKG 



Consensus Sequence (SEP ID NO:306) : 

A- (C/G) -C T-G-T G-A- (C/T) G-G- (C/G) T-G-C A-A-G 
S/T C D G C K 

G-G- (C/T) 
G 

Complexity: 2 4 = 16 
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Variant 6: 7 members 

tatggggtcagcgcctgtgagggctgcaagggcttcttccgccgc 

Y G V S A C E G C K G F F R R 

tatggggtcagcgcctgtgagggatgtaagggctttttccgcaga 

Y G V S A C E G C K G F F R R 

tacggtgtgcacgcctgcgagggctgcaagggctttttccgtcgg 

Y G V H A C E G C K G F F R R 

tatggagttcatgcttgcgaaggctgtaagggtttctttcggaga 

Y G V H A C E G C K G F F R R 

tacggtgttcatgcatgtgaggggtgcaagggcttcttccgtcgt 

Y G V H A C E G C K G F F R R 

tacggagtccacgcgtgtgaaggctgcaagggcttctttcggcga 

Y G V H A C E G C K G F F R R 

tatggagttcatgcttgtgaaggatgcaagggtttcttccggaga 
YGVHACEGC K G F F R R 



SEQ ID NO:49) 
SEQ ID NO:50) 

SEQJDNO:51) 
SEQ ID NO:52) 

SEQ ID NO:53) 
SEQ ID NO:54) 

SEQ ID NO:55) 
SEQ ID NO:56) 

SEQ ED NO:57) 
SEQ ID NO:58) 

SEQ ID NO:59) 
SEQ ID NO:60) 

SEQIDNO:61) 
SEQ ID NO:62) 



Signature Moti f (SEP ID NO:307) : A C E G C K G 



Consensus Sequence (SEP ID NO:308) : 



G-C- (A/C/G/T) T-G- (C/T) G-A- (A/G) G-G- (A/C/G) T-G- (C/T) 
A C E G C 



A-A-G G-G- (C/T) 
K G 



Complexity: 2 4 x 3 x 4 = 16 x 12 = 192 



Variant 7: 6 members 

ttcaatgtcatgacatgtgaaggatgcaagggctttttcaggagg (SEQ ID NO:63) 
F N V M T C E G C K G F F R R (SEQ ID NO: 64) 
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tttaatgcgctgacttgtgagggctgcaagggtttcttcaggaga (SEQ ID NO:65) 

F N A L T C E G C K G F F R R (SEQ ID NP:66) 

taccgctgtatcacgtgtgaaggctgcaagggtttctttagaaga (SEQ ID NO:67) 

Y R C I T C E G C K G F F R R (SEQ ID NO:68) 

taccgctgtatcacttgtgagggctgcaagggcttctttcgccgc (SEQ ID NO:69) 

Y R C I T C E G C K G F F R R (SEQtoNP:70) 

tacggactgctcacgtgtgagagctgcaagggcttcttcaagcgc (SEQ ID NO:71) 

Y G L L T C E S C K G F F K R (SEQIDNP:72) 

tatgggctcctcacctgtgaaagctgcaagggattttttaagcga (SEQ ID NO:73) 

Y G L L T C E S C K G F F K R (SEQ ID NO: 74) 



Signature Moti f (SEP ID NO:309) : TCEG/SCKG 



Consensus Sequence (SEP ID NO:310) : 

A-C- (A/C/G/T) T-G-T G-A- (A/G) ( A/G) -G- (A/C) T-G-C A-A-G 
T C ~ E " G/S C K 

G-G(A/C/T) 
G 

Complexity: 2 3 x3x4 = 8x3x4 = 96 



Variant 8: 4 members 

tatggggtagtcacctgtggcagctgcaaagttttcttcaaaaga (SEQ ID NP:75) 

Y G V V T C G S C K V F F K R (SEQIDNP:76) 

tatggagctctcacatgtggaagctgcaaggtcttcttcaaaaga (SEQ ID NP:77) 

Y G A L T C G S C K V F F K R (SEQ ID NP:78) 

tatggtgtccttacctgtgggagctgtaaggtcttctttaagagg (SEQ ID NP:79) 

Y G V L T C G S C K V F F K R (SEQ ID NP: 80) 

tatggagtcttaacttgtggaagctgtaaagttttcttcaaaaga (SEQ ID NP:81) 

Y G V L T C G S C K V F F K R (SEQIDNP:82) 
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Signature Moti f (SEP ID NO:31 1) ; T C G S C K V 



Consensus Sequenc e (SEP ID NO:312) : 

A-C- (A/C/T) T-G-T G-G- (A/C/G) A-G-C T-G- (C/T) A-A- (A/G) 
T C G S C K 

G-T- (C/T) 
V 

Complexity: 2 3 x 3 2 = 8 x 9 = 72 
Variant 9: 4 members 

tacggcgtggcctcctgcgaggcttgcaaggccttcttcaagagg (SEQ ID NP:83) 

Y G V A S C E A C K A F F K R (SEQ ID NP: 84) 

tatggtgtggcatcctgtgaggcctgcaaagccttcttcaagagg (SEQ ID NP:85) 

Y G V A S C E A C K A F F K R (SEQIDNP:86) 

tatggagtctggtcctgtgagggctgcaaggccttcttcaagaga (SEQ ID NP:87) 

Y G V W S C E G C K A F F K R (SEQ ID NP:88) 

tatggagtctggtcgtgtgaaggatgtaaggccttttttaaaaga (SEQ ID NP:89) 

Y G V W S C E G C K A F F K R (SEQ ID NP:90) 

Signature Moti f (SEP ID NP:313) : S C E A/G C K A 
Consensus Sequenc e (SEP ID NP:314) : 

T-C- (C/G) T-G- (C/T) G-A- (A/G) G- (C/G) - (A/C/T) T-G- (C/T) 

S " C E A/G C 

A-A- (A/G) G-C-C 
K A 

Complexity: 2 6 x 3 = 64 x 3 = 192 

Total Complexity of library: the sum of the complexities of subgroups 1-9 = 1,664. 
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The library is constructed from the following semi-randomized oligonucleotides: 
Variant 1 (SEQ ID NO:269) 

5' -pCCAGGACGACAAAAAGACHTGYGARGGSTGYAARGGHCTTTTTAGGCTTTTCGG-3 ' 
Variant 2 (SEQ ID NO:270) 

5' -pCCAGGACGACAAAAAGWSYTGYGARGGBTGCAARGGNCTTTTTAGGCTTTTCGG-3 ' 
Variant 3 (SEQ ID NO:271) 

5' -pCCAGGACGACAAAAAGACSTGCGAGGGCTGCAARAGYCTTTTTAGGCTTTTCGG-3 ' 
Variant 4 (SEQ ID NO:272) 

5' -pCCAGGACGACAAAAAGCCTGCRACGGCTGCWSMGGYCTTTTTAGGCTTTTCGG-3 ' 
Variant 5 (SEQ ID NO:273) 

5' -pCCAGGACGACAAAAAGASCTGTGAYGGSTGCAAGGGYCTTTTTAGGCTTTTCGG-3 ' 
Variant 6 (SEQ ID NO:274) 

5 ' -pCCAGGACGACAAAAAGCNTGYGARGGVTGYAAGGGYCTTTTTAGGCTTTTCGG-3 ' 
Variant 7 (SEQ ID NO:275) 

5 ' -pCCAGGACGACAAAAAGACNTGTGARRGMTGCAAGGGHCTTTTTAGGCTTTTCGG- 3 ' 
Variant 8 (SEQ ID NO:276) 

5' -pCCAGGACGACAAAAAGACHTGTGGVAGCTGYAARGTYCTTTTTAGGCTTTTCGG-3 ' 
Variant 9 (SEQ ID NO:277) 

5 ' -pCCAGGACGACAAAAAGTCSTGYGARGSHTGYAARGCCTTTTTAGGCTTTTCGG-3 ' 

In the above, mixtures of nucleotides (wobbles) are denoted using the following standard 
nomenclature: 
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Table 2 



Wobble 


Nucleotides 


r> 




V 




TJ 


A J.PJ.T 
A+C+ 1 


K 


U+l 


M 




N 


A+C+G+T 


R 


A+G 


S 


C+G 


V 


A+OG 


W 


A+T 


Y 


C+T 



The semi-randomized oligonucleotides are resuspended in TE buffer and combined in 
direct proportion to their complexities to a final concentration of 0.92 jxM. One hundred eight 
pmol of the semi-randomized oligonucleotide mixture is combined with 21.6 pmol each of 
adapter oligonucleotides Univ-l(FseI) and Univ-2(AscI). 

Univ-UFseD : 5 ' - CTTTTTGTCGTCCTGGCCGG - 3 ' (SEQ ID NO:278) 
Univ-2(AscD : 5' - pCGCGCCGAAAAGCCTAAAAAG - 3 ' (SEQ ID NO:279) 

The oligonucleotides are annealed by heating to 70 °C for 5 minutes and slowly cooling 
to room temperature (-3 hours). The annealed oligonucleotides are ligated to 0.216 pmol of an 
Fsel/AscI-digested vector bearing opposing human U6 and murine U6 promoters. Construction 
of this vector is described in U.S. Patent Application Serial Number 10/626,512. The nucleotide 
sequence of the human U6 and murine U6 promoters between the TATA box and the 
transcription start site was modified to contain Fsel and AscI restriction sites, respectively, as 
indicated below: 
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Human U6/murine U6 Opposing Promoter Cassette 

(Fsel and AscI sites in lower case letters): 

GGATCCAAGCTTAAGGTCGGGCAGGAAGAGGGCCTATTTCCCATGATTCC 
TTCATATTTGCATATACGATACAAGGCTGTTAGAGAGATAATTAGAATTA 
ATTTGACTGTAAACACAAAGATATTAGTACAAAATACGTGACGTAGAAAG 
TAATAATTTCTTGGGTAGTTTGCAGTTTTAAAATTATGTTTTAAAATGGA 
CTATCATATGCTTACCGTAACTTGAAAGTATTTCGATTTCTTGGCTTTAT 
ATATCggccggccTCGAggcgcgccATATTTATAGTCTCAAAACACACAA 
TTACTTTACAGTTAGGGTGAGTTTCCTTTTGTGCTGTTTTTTAAAATAAT 
AATTTAGTATTTGTATCTCTTATAGAAATCCAAGCCTATCATGTAAAATG 
TAGCTAGTATTAAAAAGAACAGATTATCTGTCTTTTATCGCACATTAAGC 
CTCTATAGTTACTAGGAAATATTATATGCAAATTAACCGGGGCAGGGGAG 
TAGCCGAGCTTCTCCCACAAGTCTGTGCGAGGGGGCCGGCGCGGGCCTAG 

AGATGGCGGCGTCGGATCC (SEQ ID NO:280) 

Ligation is performed overnight at 16 °C. One-fifth of the ligation reaction is used to 
transform electrocompetent bacteria (DH12S), resulting in 10 6 - 10 7 cfu/(xg DNA. 

The relatively low complexity (1,664) permits the delivery of the resulting library to the 
host cells by transient transfection in a 96-well format. The library is arrayed by picking -4,000 
individual colonies and inoculating 750 ^1/well of TB media (containing appropriate antibiotics) 
in 2-ml deep well 96-well plates (VWR). Following incubation for 20 hours, the cultures are 
pooled in groups of 10. DNA minipreps (Qiaprep Spin Miniprep Kits, Qiagen) are prepared 
from 1.5 ml of pooled bacterial culture. (The remainder of each culture is aliquotted and frozen 
for future use.) The purified DNA from each pool is quantitated using Rediplate 96 PicoGreen 
dsDNA Quantitation Kits (Molecular Probes). DNA from each pool is diluted to 100 ng/nl and 
stored in 96-well plates. Each well contains DNA encoding up to 10 unique siRNAs. 
Transfection of target cells is performed in a 96-well format using standard methods.- 
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PATENT 



Please insert the accompanying paper copy of the Sequence Listing, page numbers 1 to 58, at the 
end of the application. 
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